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Abstract. A prospective for the inclusive search of the Standard Model Higgs boson in the decay 
channel // ^ 77 is presented with the CMS experiment at the LHC. The analysis relies on a 
strategy to determine the background characteristics and systematics from data. The strategy is 
applied to a Monte Model of the QCD background, with full simulation of the detector response. 
The discrimination between signal and background exploits information on photon isolation and 
kinematics. The resolution for the reconstructed Higgs boson mass profits from the excellent energy 
resolution of the CMS crystal calorimeter. A discovery significance above 5 sigma is expected at 
integrated LHC luminosities below 30 fb^' for Higgs boson masses below 140 GeV/c^. 
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INTRODUCTION 

Since the beginning of the LHC project the H yy channel is considered as a major 
discovery channel of Higgs particles [1] at masses between LEP limit 1 14.4 GeV/c^ ^ 
and about 140 GeV/c^. Indeed, despite a high rate of the H ^ bb decays at low mass, 
it remains out of our interest due to large QCD background and low mass resolution of 
the di-jet state. Since the // — > 77 decay involves virtual loops, it has relatively small 
branching fraction of about 10^. However, the signal has a clean signature with two 
high Et isolated photons. Due to an excellent resolution of the CMS electromagnetic 
calorimeter (ECAL), it can be identified as a narrow peak at a Higgs boson mass on the 
top of a continuous background. 

We investigated two analysis methods for the H -^yy searches at CMS JsiH]. In addi- 
tion to the conventional cut-based analysis, we report the optimized discovery oriented 
technique based on a multivariate optimization. The latter exploits a difference in the 
signal and the background kinematics. These studies use a full CMS detector simulation 
program assuming 2 x 10^-^cm^^s^^ machine luminosity and including collision effects 
such as minimum bias and underling events in the simulation model. 

RATES AND CROSS SECTIONS 

The inclusive search of the Higgs boson implies any production mechanism. In proton- 
proton collisions at LHC energy about 80% of the Higgs bosons are produced in the 
gluon fusion reaction, while the rest are produced in association with either qq pairs 



(WVB or tt fusion) or vector bosons. Table [T] presents the cross sections and H yy 
branching fractions for the different Higgs boson mass [6]. 

TABLE 1. Signal NLO cross sections and branching ratios. 



Mh (GeV/c2) 


115 


120 


130 


140 


150 


a gg fusion (pb) 


39.2 


36.4 


31.6 


27.7 


24.5 


a WVB fusion (pb) 


4.7 


4.5 


4.1 


3.8 


3.6 


o WH, ZH, tm (pb) 


3.8 


3.3 


2.6 


2.1 


1.7 


Total a (pb) 


47.6 


44.2 


38.3 


33.6 


29.7 


my^-> 77), % 


0.21 


0.22 


0.22 


0.20 


0.14 




99.3 


97.5 


86.0 


65.5 


41.5 



We consider two sorts of the background processes. The "irreducible" background 
has two real high isolated photons. Such a signature can be produced by both 
quark-antiquark annihilation ("born") and gluon-gluon fusion ("box") as well as quark- 
gluon Compton scattering with isolated bremsstrahlung processes. We estimate the 
total differential rate of "irreducible" backgrounds about 100 fb/GeV/c^ at 120 GeV/c^ 
mass. Thus, we require about 1 GeV/c^ two-photon mass resolution for a powerful 
discrimination of the signal (see Table [B. 

The dominant QCD processes like 7+jet and di-jets may lead to the fake photons in- 
duced by neutral hadrons tz^ or 77 and produced in the jet fragmentation processes. The 
"reducible" background has at least one non-isolated photon. The PYTHIA LO cross 
sections of the background processes are presented in Table [21 To compute the back- 
ground yields at NLO cross sections we apply the K-factors summarized in Table [3] 
Thus, a jet suppression has to be better then 10^ to diminish its rate at the level of the 
"irreducible" background. 

DETECTOR 

The conceptual design of the CMS detector [5] exploits the conventional layout of the 
particle detector at hadron colliders. A dedicated electromagnetic calorimeter is required 
for a powerful search of the Standard Model (SM) Higgs boson. Indeed, the H ^ yy 
decay was employed as a benchmark channel in the design of the ECAL. It comprises 
75848 lead tungstate (PWO4) crystals placed in two pseudo-rapidity regions: the barrel 
(I77I < 1.479) and the endcap (1.479 <\r]\< 3.0). The ECAL endcap is equipped by 
a preshower (ES) system for rejection. Due to low Moliere radius (2.19 cm) about 



TABLE 2. LO cross sections for backgrounds. 



Process 


Pt 


<7lo 




(GeV/c) 


(pb) 


PP^ 77 (born) 


>25 


82 


pp— > 77 (box) 


>25 


82 


PP^ 7+jet 


>30 


5 X 10"^ 


pp^jets 


>50 


2.8 X 10^ 


Drell Yan ee 




4 X 10^"* 



TABLE 3. Background K-factors applied 


for the PYTHIA cross section 




Process 


K-factor 


pp^ 77 (born) 


1.5 


at the test-beam, pp^ 77 (box) 


1.2 


pp— > 7+jet (2 prompt) 


1.72 


PP^ 7+jet (1 prompt) 


1 


pp^jets 


1 



80% of the shower energy is deposited in one crystal having 2.2x2.2 cm^ front face size. 
Such a high granularity allows us a powerful identification of isolated photons. 
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FIGURE 1. 
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-beam (GsV) 
Energy resolution of the barrel supermodule. 

Figure [U shows the energy resolution of the ECAL barrel obtained with an electron 
test-beam. The stochastic (5), the noise (A^^) and the constant (C) terms are obtained by a 
fit with the following function: 



+ 



(1) 



Clustering algorithm accounts strong magnetic field and presence of material in front of 
the calorimeter. Measured energy can be computed as 



clusters 



(2) 



where F is a correction function, Gci is a calibration factor expressed as a product 
of a global absolute scale and intercalibration constants, respectively. A, is a signal 



amplitude, in ADC counts. Sum of the incident energy in 5x5 crystals provide the best 
energy estimation for the unconverted photons (F = 1) . The dedicated procedure of 
ECAL calibration is foreseen with the physics events such as W — > ev, Z — > e^e^ and 
%^ yy. 

ANALYSIS AND RESULTS 

Higgs decays into two-photon state are selected with extremely high efficiency both 
Level- 1 (99.7%) and High Level Triggers (88.4%). Due to tighter kinematical and 
isolation critera applied in the analysis, no additional signal restriction by trigger were 
observed. 

Two-photon candidates are selected within fiducial volume \r\\ < 2.5, and > 
40, 35 GeV/c. Since fake photons produced in the jets are accompanied by additional 
energetic particles, we require no tracks with pr > 1-5 GeV/c inside a cone around 
photon with a radius Ai? < 0.3. In addition, we demand Y.Et < 6(3) GeV/c in the ECAL 
barrel (endcap) in a cone 0.06 <Mi< 0.35 and Y.Et < 6(5) GeV/c in the HCAL barrel 
(endcap) in a cone A/? < 0.3. The energy response of individual crystals were smeared 
according to calibration precision expected for an integrated luminosity of 10 fb^ and 
obtained with W ^ ev control sample. 

Due to longitudinal spread of the interaction vertices (~ 50 mm) the di-photon mass 
is smeared by about 1.5 GeV/c^. Indeed, Higgs bosons are produced in association with 
tracks from underlying events, initial state gluon radiation and associative particles qqH, 
WH, ZH. These tracks allow us to identify the interaction vertex, in about 80% of cases, 
and correct momenta of the photons. 

Figure [2] shows the di-photon mass distribution for the signal (scaled by a factor 10) 
and the background events after the selection. For a 120 GeV/c^ Higgs boson, the signal 
efficiency of 30% yields to 29.3 observed Higgs events per inverse femtobam, while the 
total background is 178 fb/GeV/c^ as shown in Table |4l Despite asymmetric lineshape 
about 60% of the signal remains within ±1 GeV/c^ mass window. 

TABLE 4. Background yields in fb/GeV/c^ 

Process Mh, (GeV/c^ ) 





115 


120 


130 


140 


PP^ 77 (born) 


48 


44 


36 


29 


PP~^ 77 (box) 


36 


31 


23 


16 


PP^ 7+jet (2prompt) 


43 


40 


32 


26 


PP^ 7+jet (1 prompt) 


40 


34 


22 


19 


pp^jets 


29 


27 


20 


18 


Drell Yan ee 


2 


2 


1 


1 


Total background 


203 


178 


134 


109 



Including of new variables or cuts optimization do not lead to the further improvement 
of the signal over background ratio (s/b). However, we find the R9, fraction of the super- 
cluster energy deposited a small 3x3 crystal area, as a powerful discriminator of a n'^. 
Indeed, high value of R9 readily identifies converted photons and automatically selects 




M,, (GeV) 

FIGURE 2. Expected di-photon invariant mass distribution for the cut-based analysis normalized to an 
integrated luminosity of 1 fb^' . Signal is scaled by a factor 10. 

against 7lP. The converted category remains background enriched. However, the use of 
a track information would allow further rejection of the background. 

To improve signal significance, we split the analyzed sample into categories with 
different s/b. Asides single category, we consider /oar categories, 2 categories where 
both photons are detected in the barrel and 2 where at least one photon is in the endcap. 
Each of these 2 categories are splitted for high (>0.93) and low (<0.93) values of 
R9. Then, we form twelve categories sample based on 3 ranges in 7?9 (0.9, 0.948) 
and 4 pseudo-rapidity regions in Ir/I (0.9, 1.4, 2.1). Table [5] summarizes the integrated 
luminosity needed to discover or to exclude Higgs boson for the different event splitting. 
Confidence level is calculated with a frequentistic approach using a log-likelihood ratio 
(LLR) estimator. These outcomes are computed for many possible pseudo-experiments 
for a hypothesis when the signal exists and that it does not. 



TABLE 5. Integrated luminosity required for observation 
or exclusion of the Higgs boson with a mass of 120 GeV/c^. 





5(T 


3a 


95% C.L. excl. 


1 category 


24.5 (39.5) 


8.9(11.5) 


4.1 (5.8) 


4 categories 


21.3 (26.0) 


7.5 (9.1) 


3.5 (4.8) 


12 categories 


19.3 (22.8) 


7.0(8.1) 


3.2 (4.4) 



The optimized analysis exploits six categories, 3 categories where both photons are 
detected in the barrel and 3 where at least one photon is in the endcap. These 3 categories 
are defined according to measured R9, as for the cut-based analysis splitted into 12 
categories. Loose cuts are applied for the isolation variables which are used as a neural 
network prior. Fixed cut for the optimized output variable slightly improves a 

background rejection. Asides two-photon mass, four other kinematical variables can be 



used for further discrimination of the signal. They are transverse energy of each photon 
Ej , r\ difference between two photons \r\1~r\2\ and longitudinal momentum of the 
photon pair Pj^ . These variables are combined with NN,5o for the further neural network 
optimization. To reject the background rate a neural net is trained with the mass side- 
band events and results into NN^,„ output variable. Due to negligible correlation between 
invariant mass and NN^tin^ the s/b expectation can be estimated for each event as 



s 



(3) 



kin 



Finally, the events are binned according to the s/b estimate. Then, we exploit \og{s/b) 
distribution to compute the confidence level with the LLR estimator. 
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FIGURE 3. Integrated luminosity required for a 5 a discovery as a function of the Higgs mass. 

Figure[3]shows the integrated luminosity needed for a 5a discovery. For a 120 GeV/c^ 
Higgs boson, such a discovery can occur with about 7.7 fb^^ recorded data using an 
optimized technique, while it is required about 22.8 fb^ of integrated luminosity using 
conventional cut-based analysis technique (see Table [5]). We address major systematic 
errors. The dominant contribution is due to background subtraction. It is evaluated from 
the uncertainty of the fit function in the mass side-bands. 



CONCLUSIONS 



The SM Higgs boson can be readily discovered with an integrated luminosity less then 
30fb \ if its mass is below 140 GeV/c^. We investigated the standard cut-based and 
discovery oriented optimized analysis techniques. The best achievement corresponds 
to a 120 GeV/c^ Higgs boson where 7.7 fb^^ of a recorded data is required for the 
5(7 discovery. The analysis strategy implies data driven methods of the background 
subtraction using a side-band mass regions. There is a potential room for improvement 
by adding the track information to identify converted photons and ECAL preshower to 
reject tjP. 
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